improved alignment of bisulfite sequencing data using cpg islands

نویسندگان

nadia barjaste

reza nadimi

majid alipour

چکیده

dna methylation is an important biological process involving in human disease such as cancer insomia and diabetes. bisulfite sequencing (bs-seq) with next-generation technology is an accurate method for measuring dna methylation. bs-seq data analysis is a considerable way to recognize methylated cytosines and several tools have been developed to analysis bs-seq such as bs-seeker, b-solana, brat, bsmap and etc. in this paper, we propose a novel idea to get more efficiency in the sequencing process, this idea will improve the rate of accuracy in the bsolana alignment tool using a new method in the preprocessing step. our method is based on modification in some regions of dna strand named cpg islands. cpg islands are significant regions in dna strand which frequency of methylated cytosines is less than other cpg contexts. we compared our method with previous methods in the preprocessing of the original bsolana tool using on hg19 reads. the comparison shows that new method provides more ability to align read sequences in the bsolana.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CpG_MPs: identification of CpG methylation patterns of genomic regions from high-throughput bisulfite sequencing data

High-throughput bisulfite sequencing is widely used to measure cytosine methylation at single-base resolution in eukaryotes. It permits systems-level analysis of genomic methylation patterns associated with gene expression and chromatin structure. However, methods for large-scale identification of methylation patterns from bisulfite sequencing are lacking. We developed a comprehensive tool, CpG...

متن کامل

BS-virus-finder: virus integration calling using bisulfite sequencing data

Background DNA methylation plays a key role in the regulation of gene expression and carcinogenesis. Bisulfite sequencing studies mainly focus on calling single nucleotide polymorphism, different methylation region, and find allele-specific DNA methylation. Until now, only a few software tools have focused on virus integration using bisulfite sequencing data. Findings We have developed a new ...

متن کامل

Whole-genome bisulfite sequencing maps from multiple human tissues reveal novel CpG islands associated with tissue-specific regulation

CpG islands (CGIs) are one of the most widely studied regulatory features of the human genome, with critical roles in development and disease. Despite such significance and the original epigenetic definition, currently used CGI sets are typically predicted from DNA sequence characteristics. Although CGIs are deeply implicated in practical analyses of DNA methylation, recent studies have shown t...

متن کامل

A comprehensive evaluation of alignment software for reduced representation bisulfite sequencing data.

Motivation The rapid development of next-generation sequencing technology provides an opportunity to study genome-wide DNA methylation at single-base resolution. However, depletion of unmethylated cytosines brings challenges for aligning bisulfite-converted sequencing reads to a large reference. Software tools for aligning methylation reads have not yet been comprehensively evaluated, especiall...

متن کامل

Strategies for analyzing bisulfite sequencing data.

DNA methylation is one of the main epigenetic modifications in the eukaryotic genome; it has been shown to play a role in cell-type specific regulation of gene expression, and therefore cell-type identity. Bisulfite sequencing is the gold-standard for measuring methylation over the genomes of interest. Here, we review several techniques used for the analysis of high-throughput bisulfite sequenc...

متن کامل

An alignment algorithm for bisulfite sequencing using the Applied Biosystems SOLiD System

SUMMARY Bisulfite sequencing allows cytosine methylation, an important epigenetic marker, to be detected via nucleotide substitutions. Since the Applied Biosystems SOLiD System uses a unique di-base encoding that increases confidence in the detection of nucleotide substitutions, it is a potentially advantageous platform for this application. However, the di-base encoding also makes reads with m...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید


عنوان ژورنال:
journal of advances in computer research

ناشر: sari branch, islamic azad university

ISSN 2345-606X

دوره 5

شماره 2 2014

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023